Speech Processing
نویسنده
چکیده
The processing of speech signals has a long and venerable history. As early as 1770 Wolfgang von Kempelen demonstrated his mechanical talking machine to the courts of Europe. In 1928 Homer Dudley invented the “vocoder” (voice coder) arguing that speech is specified by a few slowly varying parameters requiring only a fraction of the telephone bandwidth for transmission. A digital vocoder was first put into service in World War II for a secure telephone link connecting Roosevelt, Churchill and major military commands around the world. Exploiting properties of the human ear, such as “phase deafness” and auditory masking, perceptual coders have been demonstrated that transmit speech (and even high-quality music!) at fractional bits per sample. Applications for mobile radio, voice-email, and Internet radio abound. The success of speech recognition depends on the size of the vocabulary and the quality of the speech signal. The zero-error recognition of unrestricted, continuous speech from a noisy environment (the “electronic secretary”), however, is still in the
منابع مشابه
Auditory processing skills in brainstem level of autistic children: A Review Study
Aims: Autism is a pervasive developmental disorder. Deficit in sensory functions is one of the characteristics of people with autism, and usually these people show abnormality in processing and correct interpretation of auditory information. Also people with Autism show problems in communicating with others. This review article deals with the accurate understanding of Auditory processing skills...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملDeveloping a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity
Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...
متن کاملبررسی درک گفتار با فشردگی زمانی در سالمندان
Objectives: Most of the studies performed on aging and auditory system have historically focused on speech perception disorders in elderly people. According to studies, speech discrimination disorders in aged people usually result from auditory temporal processing impairment. Our study was done to determine the ability of aged people to discriminate time compressed speech. Methods & Material...
متن کاملRecognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model
Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999